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ABSTRACT 

The reionization of hydrogen in the intergalactic medium (IGM) is a crucial landmark in 
the history of the universe, but the processes through which it occurs remain mysterious. In 
particular, recent numerical and analytic work suggest that reionization by stellar sources is 
driven by large-scale density fluctuations and must be inhomogeneous on scales of many co- 
moving Mpc. We examine the prospects for constraining the topology of neutral and ionized 
gas through Lya absorption of high-redshift sources. One method is to search for gaps in the 
Gunn-Peterson absorption troughs of luminous sources. These could occur if the line of sight 
passes sufficiently close to the center of a large HII region. In contrast to previous work, we 
find a non-negligible (though still small) probability of observing such a gap before reioniza- 
tion is complete. In our model the transmission spike at z — 6.08 in the spectrum of SDSS 
Jl 148H-5251 does not necessarily require overlap to have been completed at an earlier epoch. 
We also examine the IGM damping wing absorption of the Lya emission lines of star- forming 
galaxies. Because most galaxies sit inside of large HII regions, we find that the severity of ab- 
sorption is significantly smaller than previously thought and decoupled from the properties of 
the observed galaxy. While this limits our ability to constrain the mean neutral fraction of the 
IGM from observations of individual galaxies, it presents the exciting possibility of measur- 
ing the size distribution and evolution of the ionized bubbles by examining the distribution of 
damping wing optical depths in a large sample of galaxies. 
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1 INTRODUCTION 

The reionization of the intergalactic medium (IGM) constitutes a 
milestone in the history of the universe, because it marks the epoch 
at which (radiative) feedback from the first generations of luminous 
objects transformed the universe on the largest scales. It therefore 
offers a fascinating probe of both the IGM and these first luminous 
sources, and astronomers have focussed a great deal of attention 
on understanding the transition. Intriguingly, three independent ob- 
servational techniques offer constraints that appear (at first sight) 
mutually contradictory. One comes from spectra of high-redshift 
quasars identified through the Sloan Digital Sky Survey (SDSS).^ 
All f our quasars with z > 6.2 show complete iGunn & PetersonI 
jl965i | ) absorption over substantial path lengths in their spectrum 
isecker et alJl200lt iFan et aljEooa IWhite et ajFlOOal iFan et alJ 
|2004); this implies a mean neutral fraction xh ^ W~'^ and a rapid 
change in the ionizing background at this epoch (Fan et al. 2002; 
but see lSongailal2004l for a different interpretation). Both of these 
properties indicate that the tail end of reionization occurs at z ~ 6 



* Email: sfurlane@tapir.caltech.edu 

^ See|http://www.sdss.org/|for more information on the SDSS. 



(see also* Wvithe & Lo eb"2004b^. A second constraint comes from 
observations of the cosmic microwave background (CMB). Primor- 
dial anisotropics are washed out by free electrons after reioniza- 
tion, but the same process generates a large-scale polarization sig- 
nal (Zaldarriaaa 1997). This provides an integral constraint on the 
reionization history; recent measurements imply that xh must have 
been small at 2 > 14 iKoeut et aL.2003-; .Soergel et aL.2003l) . Fi- 
nally, the relatively high temperature of the Ly g forest at z ~ 2-4 
suggests an order unity change in xh at z < 10 fTheuns et a lj2002t 
|Hui & Haiman 2003), although this conclusion is weakened by un- 
certainties about Hell reionization (e.g. JSokasian et alj2002h . 

These three observations imply that the reionization his- 
tory must have been complex and extended. Such a history is 
inconsistent with theoretical expectations for "normal" galaxies, 
which cause a simple tra nsition with a s moothly increasing ion- 
ized fraction (e.g., Barkan a & LoeblbOOlL and references therein). 
The observations appear to indicate qualitative evolution in the 
properties of the ionizing sources as well as the crucial i mpor- 
tance of feedback mechanisms tW vithe & Loeb 2003; Cen 2003^ 
iHaiman & Holdeill2003t ISokasian et aLi2003i) . Unfortunately, ex- 
tracting more detailed information about reionization appears diffi- 
cult. One strategy is "21 cm tomography" of the high-redshift IGM 
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(e.g. [Scott & Reed Il99(t iMadau et alJ Il997t IZaldarriaga eTal] 
I2OO4I Furlanetto et alJl2004al) . in which one maps the distribution 
of neutral hydrogen on large scales through its redshifted hyperfine 
transition. However, this technique requires new instruments and 
will likely not be possible for several years. Detailed analyses of 
the CMB may als o provide more information JHolder et alj|2003l : 
ISantos et all2003h but promise to be similarly difficult. 

Here we consider the study of reionization through surveys 
of high-redshift galaxies and quasars. Such observations obvi- 
ously help by me asuring the abunda nce and properties of lumi- 
nous sources (e.g., Barto n et all2004 l. Beyond this, however, Lya 
absorption by neutral hydrogen in the IGM can have powerful 
effects on the spectra of these sources JGunn & Peterson|ll965t 
lMiralda-Escude-1998.) . so detailed observations of their properties 
can also be used to constrain the IGM itself. The first possibil- 
ity is to examine the Lya emission lines of high-redshift galax- 
ies. Narrowband searches for such lines are, in fact, one of the 
most efficient techniques to i dentify these sources. Surveys cur- 
rently extend to z y 6-7 (e.g..lHu et all2002l:lKodaira et all2003l: 
iRhoads et ai]l2004 IStanwav et alJl2004bHSantos et alj|2004) and 
will likely reach higher redshifts in the coming years. Because the 
Lyof line is subject to strong absorption from the IGM, its charac- 
teristics can help to constrain the properties of the surrounding gas. 
One factor is that the damping wing absorption should make strong 
Lya emitters rarer before reionization, although Haiman (2002) 
showed that the ionized bubble around luminous galaxies will not 
necessarily completely destroy the line (see also _Madau_^^ees 
|2000| for a related discussion in the context of quasars). ISantosI 
y004!) made a thorough study of how local properties of the galaxy 
(such as winds and infall) would affect such inferences; these prop- 
erties can substantially affect the Lya absorption, but because they 
do not depend on the ionization state of the IGM they should not 
strongly affect how the relative absorption changes through reion- 
ization. Thus there remains the exciting possibility of constraining 
the IGM's ionization state through observations of high-redshift 
galaxies. For example, the recent detection of a possible 2: = 10 
galaxy bv lPello et alJ |2004|) triggered a flurry of theoretical efforts 
to constrain the ionization state of the IGM and the i onizing lu- 
minosity of the galaxy through its apparent absorption ( Loeb et al. 
[2004; Ricotti et al. 2004; Gnedin & Prada 2004; Cen et al. 2004). 
(Note that, although this object was not selected by its Lya line, its 
spectrum apparently nevertheless shows one.) 

However, none of these theoretical studies have explicitly con- 
sidered the possibility that neighboring galaxies can alter the lo- 
cal ionization state. Because high-redshift galaxies are so strongly 
clustered, any source is likely to have a number of nearby neigh- 
bors whose own HII regions may overlap with that of the detected 
galaxy even well before reionization is complete. In this case, 
the absorption could be much less than predicted by naive mod- 
els in which each galaxy is isolated. .Cea 1.2003a) suggested that 
this may be important near luminous quasars, but such objects are 
rare before reionization and are certainly not representative of the 
large-scale environment (Fan et al. 2004). Furlanetto et al. (2004b], 
hereafter FZH04) , guided by simulations ( Sokasian et al. 2003; 
ICiardi et alj|2003h and analytic work tBarkana & Loeb..2003.) . de- 
veloped a simple model to describe how large-scale density fluc- 
tuations drive the growth of HII regions. The model reproduces 
the qualitative features of numerical simulations of reionization, 
which contain a relatively small number of large HII regions 
(with sizes of several comoving Mpc) encompassing many ion- 
izing sources. These create a large-scale inhomogeneous pattern 
of neutral and ionized gas that evolves throughout reionization. 



Most luminous sources sit inside regions that are ionized relatively 
early, so they will suffer co rrespondingly less absorption from the 
IGM. lGnedin & P rada' ( 2004) used numerical simulations to exam- 
ine how the evolving bubble pattern affects the absorption suffered 
by individual galaxies, although they were able to consider only 
one particular reionization history and were limited by the finite 
size of their simulation box. In this paper, we will use the model 
of FZH04 to quantify the amount of Lya absorption expected in 
different stages of reionization. We will argue that the large HII re- 
gions cannot be neglected when interpreting Lya absorption and 
make it difficult to infer the mean neutral fraction from observa- 
tions of a small number of galaxies. However, these lines do present 
the intriguing possibility of allowing us to measure the size distri- 
bution of HII regions throughout reionization and thus to constrain 
the inhomogeneity of the process.^ 

Another technique is to study the absorption spectra of lu- 
minous quasars. At lower redshifts, these yield detailed informa- 
tion about the density structure of the IGM through the Lya for- 
est. Unfortunately, the depth of the Gunn-Peterson trough, which 
is TCP ~ 6.5 X 10^ [(1 + z)/10f^^ for a fully neutral medium, 
makes it more difficult to extract information as reionization is ap- 
proached. Before overlap, any transmission in the trough requires 
passing close to a bright ionizing source; it is particularly diffi- 
cult because of the broad Lya damping wing from the surrounding 
IGM. Nevertheless, the spectrum of SDSS 1148-1-5251 contains a 
transmission spike al z — 6.08 in both the Lya and Ly/3 troughs 
with an apparent r — 2.5 (White etal. 2003). Miralda-Esc udg 
( 1998) argued that evading the damping wing constraint required 
ionized bubbles with sizes > 1 physical Mpc and suggested that 
such r egions were only possible around bright quasars. Barkang 
(2002) showed explicitly that, if each galaxy is considered in iso- 
lation, transmission gaps in the Gunn-Peterson trough should be 
extremely rare until reionization is complete. However, these stud- 
ies did not include the biasing of sources and the correspondingly 
large HII regions. Here we use the model of FZH04 to compute 
the probability of seeing a transmission gap in the Gunn-Peterson 
trough. We show that the large ionized bubbles make the damping 
wing absorption considerably less significant, so there is a small 
(but non-negligible) possibility of observing such a gap even be- 
fore reionization is complete. We thus find that the transmission 
gap in the spectrum of SDSS Jl 148-I-525I does not require overlap 
to be complete by 2: = 6.08. 

In SJ2| we briefly review the model for reionization originally 
presented in FZH04. We then consider its implications for surveys 
of Lya emitting galaxies in f|3| and for quasar absorption spec- 
tra in fJ4| We conclude in iJ5| In our numerical calculations, we 
will assume a ACDM cosmology with Qrn = 0.3, JIa ~ 0.7, 
Qt = 0.046, H = lOO/i km s'^ Mpc"^ (with h = 0.7), n = 1, 
and (78 = 0.9, consistent with the most recent measurements 
l Spergeletal.2003i) . 



2 A MODEL FOR REIONIZATION 

Recent numerical simulations (e.g.. lSokasian et all2003^ show that 
reionization proceeds "inside-out" from high density clusters of 



^ As we were completing this work, we became aware of a similar effort to 
describe the HII regions around Lyo-line galaxies iWyithe & Loeb 2004qp. 
We note that both groups reach qualitatively similar results, although each 
include different aspects of the physics and utilise entirely independent 
methods. 
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sources to voids, at least when the source s resemble star-forming 
galaxies (e.g.,(SDrinsel & Hemauisll2003^ . We therefore associate 
HII regions with large-scale overdensities. We assume that a galaxy 
of mass rrigai can ionize a mass C'^gai, where (" is a constant that 
depends on the efficiency of ionizing photon production, the es- 
cape fraction of these photons from the host galaxy, the star for- 
mation efficiency, and the mean number of recombinations. Values 
of ^ ~ 10-40 are reasonable for normal star formation, but very 
massive stars c an increase the efficiency by an order of magnitude 
JBromm et alJ bOOl). The criterion for a region to be ionized by 
the galaxies contained inside it is then /coii > C^^^ where /coii is 
the fraction of mass bound to halos above some rrimin. We will as- 
sume that this minimum mass corresponds to a virial temperature 
of lO'* K, at which point hydrogen line cooling becomes effi cient. 
In the extended Press-Schechter model JLacev & Coleiri993h . this 
places a condition on the mean overdensity within a region of mass 



5m > 5.^{m,z) = 5,{z) ~ V2K{C)[g, 
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where K{C,) — erf^^(l — C~^)' "" ("*) 's the variance of density 
fluctuations on the scale m, 0-^;^ = cr^(mniin), and 5c{z) is the 
critical density for collapse. 

FZH04 showed how to construct the mass function of HII 
regions from 5x in an analogous way to the halo mass function 
( Press & Schechter 1974; Bondetal. 1991). The barrier in equa- 
tion Q is well approximated by a linear function in a^ , 5^ ~ 
B{m,z) = Bo + B\cj^{m). In that cas e the mass f unction of 
ionized bubbles has an analytic expression iShethll99a) : 
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where p is the mean density of the universe. Equation Q gives the 
comoving number density of HII regions with masses in the range 
(tti, m + dm). FigureQshows the resulting size distributions for 
( — 40. The dot-dashed, short-dashed, long-dashed, dotted, and 
solid curves correspond to z = 18, 16, 14, 13, and 12, respec- 
tively. We have normalized each curve by the fraction of space Q 
filled by the bubbles at the specified redshift (i.e., (3 = 1 — xh, 
where xh is the mean neutral fraction). The curves begin at the ra- 
dius corresponding to an HII region around a completely isolated 
galaxy of mass mmin- FigureQshows that the characteristic bubble 
size is much larger than this: when xh ~ 0.7 (0.25), the typical 
size is ~ 2 (16) comoving Mpc. Although we have shown the size 
distribution for a particular choice of (^, the characteristic scale of 
ionized bubbles depends primarily on zh and is nearly indepen- 
dent of C and rrimin', to first order, these quantities simply modify 
the time evolution of xh- The crucial difference between this for- 
mula and the standard Press-Schechter mass function arises from 
the fact that the barrier S^ is a (decreasing) function of m. The bar- 
rier is thus more difficult to cross as one approaches smaller scales, 
which imprints a characteristic size on the bubbles. In contrast, the 
barrier used in constructing the halo mass function, Sc{z), is in- 
dependent of mass, which yields the usual power law behavior at 
small masses. 

We refer the reader to FZH04 for a more in-depth discussion 
of this formalism. 




R (Mpc) 

Figure 1. The bubble size distribution (^"^Vdn/dlnR at several dif- 
ferent redshifts in our model, assuming f = 40 (note that R is the co- 
moving size). Dot-dashed, short-dashed, long-dashed, dotted, and solid 
lines are for z = 18, 16, 14, 13, and 12, respectively. These have Q = 
0.037, 0.11,0.3,0.5, and 0.74. 



3 LYo EMITTERS 

We will first consider the how the topology of HII regions affects 
galaxies with strong Lya emission lines. The essential idea is that 
absorption from the surrounding neutral gas will modify the ob- 
served line; in the simplest approximation, Lya lines will be much 
harder to see as xh increases because of the increased absorption. 
In i)3.1l we calculate the probability for a galaxy of a given mass to 
sit inside an ionized bubble of a given size. We will then describe in 
i|3.2l how the resulting distribution affects the prospects for surveys 
of Lya emitters. 



3.1 Galaxies and Their HII Regions 

We wish to know the probability p{mb\mh) that a halo of mass 
ruh sits inside of an HII region with mass mt (corresponding to 
a radius Rt)-'^ One of the advantages of using the excursion set 
formalism to model both the HII regions and the dark matter ha- 
los is that it is straightforward to cormect the two objects; the 
argument is similar to the treatment of non-gaussianity in §5 of 
FZH04 and to the derivation of the conditional halo mass func- 
tion I Lacev & Cole 1993). We wish to compute the probability that 
a trajectory crossing the halo mass function barrier 5c at a^ has 
previously crossed the ionization barrier Sx (or B in our approxi- 
mation) at (jf > cr^. (We will use the subscript "b" to refer to an 
ionized bubble and "h" to refer to the galaxy halo.) The distribution 
of first-crossings above 5c, p{5l^ = Sc) (here the "f" superscript 
indicates a first-crossing) corresponds to the halo mass function 
JPress & SchechteJl97llBond et alJl99lh . Similarly, the distribu- 
tion of first-crossings above B, p{5l = B), can be found directly 



•^ Note that our formalism is Lagrangian. When converting from rrif, to Ri,, 
we will assume that the HII regions have the mean density. At these early 
epochs and on large scales, this is a reasonable approximation. 
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from equation J2j. Moreover, given that a particle trajectory crosses 
B at o"5 (i.e., that this particle is part of an HII region of mass mj,), 
we can construct the conditional halo mass function in the u sual 
way b y translating the origin to [<j1,5 = B{mt)\ I L acev & Cold 
Il993h . We call this conditional probability viSl — 5c\5l — B). 
Then we use Bayes' theorem to findp((5^ — B\Sl — 5c), the prob- 
ability that a trajectory is part of an HII region of mass nib given 
that it is part of a collapsed halo with mass mh'- 

p(Sl = B\Si = Sc)piSl = Sc)dSt = 

PiSl = Sc\Sf = B)p{Sl = B)dSt. (3) 

The last three terms are all known, so the conditional probability 
is straightforward to compute. We can obtain the desired result by 
simply transforming to mass units: 
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where Bo is defined above equation J2} and is found from Bo = 
Sxim = oo). An implicit assumption behind equation J4j is that 
the density distribution p (5) on the relevant mass scales is gaussian. 
This approximation breaks down on sufficiently small scales, and 
we expect that early in reionization (when the HII regions are small) 
our formalism will be less accurate. (Note that in this regime our 
linear approximation for Sx is also only approximate.) 

We show the cumulative probabilities that galaxies of mass 
rrih = 10^ Mq and rrih ~ 10^^ M© are in bubbles with radius 
above some value in Figure|2|for the choice (^ = 40. (This choice 
leads to reionization at 2 ~ 11.) Each curve begins at the mini- 
mum bubble size to contain such an ionizing galaxy, (^rrih (i.e., the 
radius of the Stromgren sphere if the galaxy had no neighbors). We 
see that including neighboring galaxies makes a dramatic differ- 
ence to the host HII regions of these galaxies. In both cases, nearly 
half are in regions at least twice the radius of the naive estimate 
by the time xh ~ 0.5, and there is a significant tail extending to 
much larger bubbles even earlier. This is a good illustration of one 
of the principal conclusions of FZH04: the distribution of bubble 
sizes is determined not by the characteristics of individual galaxies 
but instead by the large-scale density field. The tail at large Ri, is 
more significant for smaller galaxies, because the most massive ha- 
los are less likely to have neighbors of comparable or greater mass 
so contribute relatively more to the local ionizing field. Also, note 
that the distribution clearly evolves rapidly throughout reionization, 
with the fastest evolution occurring in the regime xh ^ 0.5. 

Figure|2|reveals a minor inconsistency of our approach: the to- 
tal probability for a galaxy to be in an HII region with mb > C'^'i 
is not equal to unity, as required by our fundamental assumption 
(see iJ2j. Unfortunately, there is nothing in the excursion set result 
J4j to guarantee this. For example, imagine a trajectory that main- 
tains (5 ~ until m ~ rrih and then rises steeply to pass through 
(a1,5c). Such a trajectory is (by construction) part of a halo of 
mass m/i, but our formalism would assign it to an ionized bubble 
of mass rrib ~ rrih- Essentially this trajectory corresponds to an 
isolated halo that lacks any neighbors. Fortunately, these steep tra- 
jectories are rare, especially once the characteristic size of HII re- 
gions exceeds ("m^, (which ha ppens relatively earl y for most galaxy 
masses). We note that in the lLacev & Cold il993h merger tree pic- 
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Figure 2. The probability that halos with m/j = 10^ and 10^^ Mq are 
embedded in an HII region above a given comoving radius in the (^ = 40 
model (top and bottom panels, respectively). The curves correspond to: z = 
12 {xh = 0.26, solid), 2 = 13 (xh = 0.52, long-dashed), z = 14 
{xh = 0.7, short-dashed), 2 = 15 (xh = 0.81, dotted), and z = 16 
{xh = 0.89, dot-dashed). 



ture, these objects are halos that undergo no major mergers in the 
future, so they must be rare in a hierarchical structure formation 
scenario. The discrepancy is worst early in reionization, when 5^ is 
large and therefore more difficult to cross. Figure|2|shows that, even 
when Xh ~ 0.9, only ~ 15% of halos are assigned to ionized bub- 
bles whose nominal sizes are smaller than (mn- As expected, the 
fraction of these trajectories decreases rapidly and they can essen- 
tially be ignored once xh ^ 0.7. The discrepancy is also slightly 
smaller for massive galaxies because Sc — 5x is an increasing func- 
tion of mass (i.e., a trajectory must rise more steeply in order to 
be assigned to a small HII region). Physically, this is because more 
massive galaxies are also more highly biased. 

We have shown p(> Rb\mh) for only two masses in one par- 
ticular model of reionization. Although the bubble size distribution 
at a fixed xh is nearly independent of (^, the properties of the ioniz- 
ing galaxies clearly do change between different reionization sce- 
narios. Most importantly, the characteristic mass scale of the ioniz- 
ing galaxies decreases if reionization occurs earlier. Qualitatively, 
the distributions shown in Figure |2| would correspond to smaller 
galaxies if (^ > 40 and larger galaxies if C < 40. Note that the 
two masses we have selected are both relatively rare at 2; ~ 12-16; 
galaxies closer to the nonlinear mass scale would experience even 
larger relative effects from the HII regions of their neighbors. 



3.2 Observable Consequences 

In principle, one way to constrain the ionization state of the IGM 
is to measure the amount of absorption from neutral gas outside 
of a galaxy's HII region. The Hubble flow velocity offset between 
the galaxy and neutral gas implies that the Lya photons redshift 
out of resonance by the time they encounter neutral gas. The rele- 
vant cross-section is therefore the "damping wing" of the line pro- 
file. Assuming that the galaxy is at Zgai, with neutral gas between 
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zq and Zn, the optical depth raamp 
JMiralda-Escudell998h 



at Ao 
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Tdamp = 6.43 X 10 a;iirGp(2:gai) 
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where Ac = 1215.67 A, rGp(2gai) ~ 6.5 x 10^[(l + 2gai)/10lfff. 
is the Lya optical depth for a fully neutral IGM ( Gunn & Pete rson 
1 1965ft . xh is the neutral fraction outside of the HII region, and 



(5) 
e rsoiJ 



I{x) 



„9/2 



+ 9^7/2^9^5/2^3^3/2^9^1/2 



-5'"U 



l+x 
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Here we have assumed that zo < Zn < 2gai and made the high 
redshift approximation H{z) oc (1 + 2)^' ^. 

The distribution of bubble sizes given in equation ^4} can 
strongly affect the damping wing absorption of a galaxy's Lya line 
by pushing the neutral gas much farther from the galaxy than one 
would naively expect. By decreasing 2„, we increase the velocity 
separation between the emission line and the neutral gas, so that the 
galaxy line suffers less absorption. Neglecting peculiar velocities, 
the damping wing absorption depends only on the separation be- 
tween the galaxy and the edge of the HII region.* It is thus straight- 
forward to transform equation J4j into the distribution of Tdamp. 
The only subtlety is that the galaxy need not sit exactly in the cen- 
ter of its ionized bubble. We will assume for simplicity that galaxies 
are uniformly distributed throughout the HII region, with the only 
restriction that each be a distance i?miii from the edge. We set -Rmin 
equal to the radius of the galaxy's HII region if it were completely 
isolated, because its own photons guarantee that the galaxy sits at 
least this distance from the nearest neutral gas. Note that the ex- 
treme assumption that all galaxies sit precisely in the center of the 
ionized bubble changes our results by < 25% (much less in most 
cases). The true distribution will be somewhere in between these, 
because high-redshift galaxies are highly biased. 

Figure|3|shows the resulting distributions for the same galax- 
ies considered in Figure|2| In each case we scale Tdamp to the value 
it would have if the galaxy were completely isolated, riso; the cor- 
responding values at 2 = 12 are marked for the two halo masses. 
(Note that Tdamp is ~ 30% higher at 2 = 16; see equation (S).) To 
compute Tdamp, we have assumed that A2d = 2„ — 20 = 0.5 and 
Xh ~ 1 in this region. The true path length of neutral gas depends 
on the distance between individual HII regions and will evolve as 
the bubbles grow and overlap. We have ignored this effect for sim- 
plicity because it does not affect the main thrust of our argument. 
We consider a more detailed model in iJ4] Our choice here essen- 
tially maximises the damping wing absorption because gas beyond 
Azd = 0.5 is far enough away in velocity space to render its ab- 
sorption unimportant. The simplest refinement would be to scale 
Tdamp by the mean neutral fraction, xh (see i|4.2> . Note that we 
also neglect absorption by gas within the ionized bubble; we show 
below that this resonant absorption is subdominant at the line center 
(and especially on the red side) except when Tdamp ^ 1. 

The figure shows that there is a non-negligible probability of 
substantially reduced absorption even early on, when xh = 0.9: 



* We assume pure Hubble flow in our calculations. Infall onto collapsed 
halos inside the HII region 1 Barkana 2004- .Santos_20Q4.) should have only 
a small effect on the damping wing. 
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Figure 3. The probability that halos with m^ = 10^ M0 and m^ = 
10^^ Mq have a damping wing optical depth below some value in the 
(^ = 40 model. In each case we parameterise Tdamp in terms of the value 
it would have if only the galaxy's own HII region surrounded it, tisq. The 
curves correspond to: z = 12 (xh = 0.26, solid), 2 = 13 (xh = 0.52, 
long-dashed), z = lA (xh = 0.7, short-dashed), 2 = 15 {xh = 0.81, 
dotted), and 2 = 16 (xh = 0.89, dot-dashed). 



~ 20% of 10^ Mq galaxies have Tdamp < 15 and ~ 5% have 
Tdamp < 6. As described above, the relative effects are larger 
for smaller galaxies, because they are more likely to have mas- 
sive neighbors. Of course, the absolute Tdamp is still significantly 
smaller for larger galaxies. If plotted as a function of Tdamp, we 
note that the probability distribution becomes less dependent on 
galaxy mass once the characteristic bubble size exceeds Rmin- 

As emphasised above, the damping wing absorption depends 
on the IGM characteristics. If the IGM is partially ionized, either 
from an earlier episode of reio nization or from a smooth ionizin g 
component such as X-rays (e.g.. lOhl2001uVenkatesan et all200lh . 
Tdamp decreases for two reasons. Obviously, the intrinsic damping 
wing is weaker if the external IGM is partially ionized. This ef- 
fect could be accounted for by simply rescaling tiso in Figure |3] 
However, the HII regions will also be larger for a given ionizing 
luminosity, because fewer photons are consumed in ionizing each 
IGM volume element. As another example, we can consider "dou- 
ble reionization" scenarios. An early generation of sources can im- 
print a characteristic bubble size on the IGM that persists until the 
second generation of sou rces has produce d more ionizing photons 
than the first generation JFurlanetto et alJ l2004a). In this case the 
damping wing absorption would be determined by the relic bub- 
bles from the first generation. Thus the distribution of Tdamp can 
constrain complex reionization histories as well as the relatively 
simple ones we have explicitly considered. 

In principle, the damping wing absorption experienced by a 
galaxy inside an HII region could be modified by its peculiar ve- 
locity. However, this turns out to be unimportant: th e typical ve- 
locity of a halo at 2 ~ 10 is ~ 100 km s"^ 1 She th & Diaferid 
120011) . much smaller than the Hubble flow difference across its own 
Stromgren sphere. Actually, the relative velocity shift between the 
galaxy and the edge of the HII region is significantly smaller, be- 
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cause linear theory peculiar velocities are determined by the mass 
distribution on the largest scales and have fairly gentle gradients on 
Mpc scales. Peculiar velocities could be more important for reso- 
nant absorption, but nonlinear effects, including gas infall an d inter- 
nal m otions such as winds, are likely much more significant iSantoJ 
l2004 . 

We see that the distribution of damping wing absorption is 
broad and, especially during the middle and late stages of reion- 
ization, does not directly map onto the ionizing output of an em- 
bedded galaxy. Thus it appears dangerous to attempt to constrain 
the ionization state of the IGM outside of the Stromgren sphere 
by comparing the inferred absorption of the line to the estimated 
ionizing luminosity (even neglecting the many complications in- 
trinsic to the galaxy; .Santos, ■2004') . Instead, the galaxy's neigh- 
bors play a crucial role in determining the topology of ionized 
gas. Thus we suggest that attempts to infer the reionization his- 
tory throug h observations of individual high-redshift galaxies (e.g., 
iRhoads & Malhotr JbOOll: iLoeb et al. 2004; Cen et al. 2004) are 
highly uncertain. For example. IHu et al. (,2002") and .Rhoads et alJ 
y004|) claim that observations of strong Lya emission lines at 
z ~ 6.5 require the galaxies to be embedded in a mostly ion- 
ized medium. Our model predicts that a galaxy with rrih = 
10^° M0 has a (0.3%, 16%, 55%) chance of rdamp <2ifxH = 
(0.75,0.5,0.25) at 2 = 6.5. These become (1%,33%,73%) if 
m-h — 10^^ Mq. Thus, it seems likely that xh ^ 0.5, but stronger 
conclusions are currently unwarranted. 

On the other hand, our results suggest that searches for high- 
redshift Lya emitters can easily be extended to the era before reion- 
ization. While some fraction of galaxies will have their Lya lines 
completely absorbed by the surrounding IGM, many will retain 
strong lines that can be isolated in narrow-band searches. Most in- 
terestingly, the distribution of inferred damping wing optical depths 
yields a measurement of the sizes of HII regions, which in turn 
tells us about the morphology of reionization. Of course, we must 
account for effects intrinsic to the galaxy and its immediate envi- 
ronments before strong constraints will emerge. 

One potential difficulty with surveys beyond reionization is 
that cosmic variance could become extremely large, because the 
local overdensity of galaxies modulates the damping wing absorp- 
tion. We illustrate this point in Figure|4| We first make the simplest 
possible assumption that the Lya line luminosity is proportional 
to the galaxy mass, and we suppose that a given survey can probe 
down to a minimum mass rriobs, neglecting any IGM absorption. 
This is obviously a poor assumption, because galaxies of a single 
mass can have a wide range of star formation rates, wind veloci- 
ties, and intrinsic absorption, but it is probably not too bad in the 
statistical sense. We wish to find the fraction of galaxies with mass 
mh that would be observable by such a survey and how they are dis- 
tributed in space. Our assumptions require that a galaxy have damp- 
ing wing absorption Tdamp smaller than Tmax = \n[mh/mohs) in 
order to be in the survey. 

The dotted curv es in Figure |4| show the halo mass function 
JPress & SchechteMl 974). The short-dashed curves show the frac- 
tion of galaxies at each mass that are accessible to our example sur- 
vey, using the formalism described above. The solid curves show 
the number density of HII regions that have Tdamp < Tmax at their 
central points, while the long-dashed curves show the fraction of 
space filled by such regions, Q.^ The left panel shows results for 
two different mass thresholds, mobs ~ lO'' and 10^" M© (thick 
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Figure 4. Cosmic vaiiance in Lya selected surveys at 2: = 8. The dot- 
ted curves show the (logarithmic) halo mass function, mi^ni^imii). Short- 
dashed curves show the number density of halos that are visible given a 
minimum observable galaxy mass, mobs- Solid curves show the approx- 
imate number density of HII regions of which these observable galaxies 
can be pail, n(> m;,), while the long-dashed curves show the fraction of 
space filled by these bubbles (Q). The left panel assumes xu = 0.5, with 
rriobs = 10'' and 10^" Mq for the thick and thin curves, respectively. The 
right panel assumes rrioba = 10^ Mq, with x/^ = 0.75 and 0.25 for the 
thick and thin curves. 

and thin curves, respectively) if xh ~ 0.5 at z = 8. The num- 
ber of visible sources is small near the mass threshold, because the 
sources must be embedded in extremely large bubbles, and they are 
confined to rare regions of the universe (though the total number 
density is not small). As ruh increases, however, the fraction of vis- 
ible galaxies rapidly approaches unity and the fraction of the uni- 
verse through which they are distributed also increases. (Note that 
Q decreases for sufficiently massive galaxies because these galax- 
ies are themselves so rare.) This is because the intrinsic luminosity 
is so large that they are visible even when completely isolated. The 
right panel shows results for mobs ~ 10^ M© if xh ~ 0.75 and 
0.25 (thick and thin curves, respectively). Note the dramatic decline 
in the volume available to the survey when xh = 0.25; while some 
fraction of galaxies are still visible, they are widely separated and 
highly clustered. Large- volume surveys will therefore be required, 
especially in the earlier stages of reionization. 

Finally, to further illustrate the importance of the size distri- 
bution of bubbles, we now calculate some example emission line 
profiles. We will take as an illustrative example a galaxy with 
mh = 10^ M0 at 2 = 10. Given a bubble size, we compute the 
damping wing absorption from equation J5j. However, in this case 
we must also include the absorption by the (mostly ionized) gas 
near the galaxy. This gas is in ionization equilibrium, so (neglect- 
ing clumping) 



aBTn?H{z) = XHnH{z)a{l + z)^ 'l^ 



(7) 



Note that these are only approximate calculations of the available number 



of bubbles, because they require the galaxy to sit at the center of the HII 
region (instead of any location within the bubble). 
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where qb is the case B recombination coefficient, ct ~ 2 x 
10~^^ cvc? is the frequency-averaged ionization cross-section, R 
is the distance in comoving units, and A'^^ is the rate at which the 
central source produces ionizing photons. To compute this quan- 
tity, we include only the ionizing photons from the galaxy (which 
dominate the background at the relevant distances). We assume a 
star formation rate of 3 Mp; vr~^, with th e ionizing photon produc 



tion rate given by iLeitherer et alj {l999) for continuous star for- 
mation at Z = O.O5Z0 and with a Salpeter initial mass function 
and a low-mass cutoff oi M = 1 M© . We then allow a fraction 
/esc =0.05 of these to escape the galaxy. These parameters are of 
course arbitra ry, but they could d escribe the putative z = 10 galaxy 
discovered by Pello et alJ i2004l) . For reference, these nominal pa- 
rameters would correspond to ^ = 400/* /(I + Tircc), where /* is 
the nominal star formation efficiency and rircc is the mean number 
of times a hydrogen atom recombines. Finally, we assume that the 
line width is cr^ = Vc/\/2, where Vc is the circular velocity of the 
halo JBarkana & Loebll200ll) : this yields o-„ = 27 km s^^ for our 
parameters.. 

Equation yields the profile of xh near to the galaxy. To 
compute the resonant optical depth Trcs at a particular observed 
wavelength Aobs, we integrate the Lya absorption cross-section 
over this xh distribution, assuming pure Hubble flow for the veloc- 
ity structure. We assume a Voigt profile with T = 10* K for the ab- 
sorption cross section, as appropriate for ionized gas. We note here 
that our treatment of resonant absorption is highly simplified and 
neglects many important effects. For example, we have assumed 
the gas to be smoothly distributed at the mean density; in reality it 
will have a complicated density distribution. Lines of sight passing 
through overdense regions will have correspondingly more absorp- 
tion; however, most of space is filled by underdense gas and the 
mean resonant absorption is somewhat weaker. We estimate this ef- 
fect in i)4.3l below. We have also neglected infall and winds, which 
move the gas through velocity space and can in principle strongly 
affect the absorption (Santos 2004). Fortunately, neither of these 
effects change our qualitative result, so we will not discuss them 
further here. 

Several example line profiles are shown in Figure |5| along 
with the corresponding optical depths. The solid, long-dashed, 
short-dashed, and dot-dashed lines assume R^, = 10, 5, 3, and 
1 Mpc, respectively. (The last of these is nearly completely ab- 
sorbed by the damping wing and does not appear in the top panel.) 
In the top panel, the upper dotted curve shows the assumed intrin- 
sic line profile; the other dotted line shows the absorption profile 
for Ri, — 10 Mpc if we neglect resonant absorption. In the bottom 
panel, the dotted line shows Trcs- For reference, an isolated galaxy 
of this mass should have Rl''° ^ (C/160)^''^ Mpc. As a fiducial 
example, (^ = 12 implies xh = 0.5 at this redshift. Then a galaxy 
with ruh = 10® Mq would have a (75%, 40%, 20%, 3%) proba- 
bility of being in bubbles larger than (1, 3, 5, 10) Mpc according 
to our formalism. It is obvious that the large-scale environment of 
the galaxy dramatically affects its absorption properties. While the 
resonant absorption will suppress the blue side of the line, it has 
little effect on the red side, which allows an estimate of Tdamp- 



4 TRANSMISSION GAPS IN THE GUNN-PETERSON 
TROUGH 

We now consider whether HII regions can appear as gaps in the 
Gunn-Peterson troughs of distant quasars (unrelated to the ionized 
bubbles that we wish to observe). We will compute 
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Figure 5. Top panel: Example line profiles for a galaxy at 2 = 10. The 
upper dotted curve .shows the assumed intrinsic profile. The solid, long- 
dashed, and short-dashed curves place the galaxy in HII regions of comov- 
ing size ijj, = 10, 5, and 3 Mpc, respectively. The lower dotted curve 
shows the absorption profile for R;, = 10 Mpc if we neglect resonant 
absolution. Bottom panel: The solid, long-dashed, short-dashed, and dot- 
dashed curves show Tjiamp if the galaxy is in an HII region of comoving 
size Rf, = 10, 5, 3, and 1 Mpc, respectively. The dotted line shows Trcs, 
which is the same for all bubble sizes. 



dn{< t) 
dz 



dr 

dz 



dm n{m)-KRn^^^{m, r ) , 



(8) 



where all distances are in comoving units, dn{< T)/dz is the num- 
ber of transmission gaps with optical depths smaller than r per unit 
redshift, and n{m) is the mass function of ionized regions. i?max 
is the impact parameter from the center of the HII region at which 
the optical depth is r. Note that the total optical depth is twice 
the damping absorption (because both the blue and red sides con- 
tribute) plus the resonant absorption. We compute the minimum 
optical depth encountered within each HII region (i.e., when the 
line of sight passes closest to the center of the bubble). 

We first estimate the resonant absorption inside the ionized 
bubbles in i )4. II We then present the resulting absorption statistics 
in i)4.2l and estimate how the inhomogeneous IGM affects our re- 
sults in i|4.3l Finally, we discuss the spectrum of SDSS II 148-1-525 1 
inS31 



4.1 Resonant Absorption 

Figure |5| shows that resonant absorption is relatively unimportant 
for Lya emitters (at least on the red side of the line). This is because 
photons emitted on the red side never pass through the Lya reso- 
nance in the IGM and because the object we are observing creates a 
local, highly-ionized bubble around itself. The resonant absorption 
within this zone is suppressed because xh is much lower near the 
galaxy. However, for lines of sight on a random path through an 
HII region, the probability of passing so close to an ionizing source 
is small. Thus most will suffer substantial resonant absorption. 

To compute Trcs, wc therefore need to know the distribution of 
ionizing sources within the HII region. We will begin by examining 
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two simple limiting cases. In the end, we will show that they yield 
nearly identical results. First, let us assume that all of the ionizing 
sources are located in the center of the bubble. In that case, we can 
use equation Q once we have an estimate for N-y . Unfortunately, 
this factor does not follow directly from the formalism described 
in SJ2| Our model uses the total time-integrated number of ionizing 
photons (essentially our C, parameter) to compute the bubble sizes, 
while we need the instantaneous photoionization rate to compute 
xh{R)- For simplicity, we will assume that each HII region con- 
tains many sources, so that their fluctuating star formation rates 
average out. This is obviously only accurate in the late stages of 
reionization, when the bubble sizes are much larger than those of 
individual galaxies; fortunately, this regime will turn out to be the 
relevant one anyway. In this case, we can set 



N-, = 



C,mb 



coll 



for a bubble of mass rrib. Then 



Xh{R) 



3as 



aRtH{z) 



3.2 X 10" 



/coll 



/c< 



\dfcoii/dz 



\Rt) 

mi 



dfcoii/dz\ _ 

R_Y / 10 ^^ 
Rb) [l + ^ 
0.15 Ni/^ 



(9) 



(10) 



where the "c" superscript refers to the case in which all the ion- 
izing sources are at the center of the bubble. Note that the /coii 
factor is typically of order 3 at these redshifts. The resonant ab- 
sorption at the specified velocity is then t^cb ~ tgpXh and will be 
quite large except near the center of the regions. We have ignored 
one subtlety in equation J9j: in general (^ includes the mean num- 
ber of recombinations rircc in the IGM, integrated over the entire 
reionization history to this point. The instantaneous ionizing rate, 
of course, does not depend on the recombination rate in the past. 
Thus T^ oc (1 + riroc)^^; in a scenario in which recombinations 
have delayed reionization but the mean ionizing emissivity is high, 
this could have a substantial effect. 

As a second limiting case, we assume that the sources are dis- 
tributed uniformly throughout the bubble. In this case the flux of 
ionizing photons is also approximately uniform across the HII re- 
gion (non-uniformity occurs only because the bubble has a finite 
size); in equation 0, we have 



A^^ 



3N-, 



AitRI 



4nR^ 

The resulting ionized fraction is 



^H = x'ij{Rt)/3. 



(11) 



(12) 



We find that the resonant absorption is Trcs ~ 

10~^TGp{z){Mpc/Rt) at a typical position within a bubble, 
or Trcs ^ 60 for Rt = 10 Mpc. Thus the mean neutral fraction 
is large, so Trcs ^ 1 unless the line of sight passes close to an 
ionizing source. This has a fortunate consequence: the cross- 
section for a line of sight wit h Tree < to is independent of the 
number of sources (see also iBarkana |^02). Suppose that the 
bubble is made up of A'^ identical galaxies. From equation Q, 
the radius at which the optical depth equals a fixed value tq is 
.Rmax(To) OC l/\/iV. Thus the total cross-section through such 
regions is A*' x ivRmax = constant and is independent of the 
number of "subclusters" within the HII region. A similar argument 
shows that the cross-section also remains unchanged if only a 



fraction of the bubbles are "active" at a given time. Of course, if 
the galaxies or subclusters are distributed throughout the bubble, 
their damping wing absorption will change with distance from the 
edge. Because this is a relatively small effect, we will use equation 
<10> to compute the resonant absorption inside the HII regions. 

Note that the above treatment assumes a uniform density IGM. 
We will show that clumping actually increases i?max in i|4.3l 

Both of the above prescriptions neglect absorption of the ion- 
izing photons as they travel through the bubble. This requires the 
total optical depth to ionizing photons rion to be < 1. We find that 



TioniRb) = 



nH{z)aB 
OTzJH{z) 

/coll 



,fc< 



0.1 



rf/coll/rfz] 
1+2 



(13) 



M/coii/d^l / V 10 
/ 0.15 \^'- I iib 



1/2 



0.023 

in both the centrally concentrated and uniform limits. Thus, ne- 
glecting any dense clumps, the majority of ionizing photons reach 
the edge of the bubble without being consumed by recombinations. 
This is, in fact, a necessary assumption of the model described in 
!j2| Of course, if a large fraction of lines of sight pass through dense 
"Lyman-limit" systems before hitting the edge of the bubble (or in 
other words if the characteristic bubble size exceeds the mean free 
path between such systems), the HII region would no longer ex- 
pand and our model breaks down. The point at which this occurs is 
of course unknown; note that it could be quite early if "minihalos" 
are common iBarkana & Loebl2002h . 



4.2 The Abundance of Transmission Gaps 

We are now in a position to compute dn{< T)/dz from equation 
Js}. We calculate the resonant absorption via the centrally concen- 
trated limit of equation JlOt and Ti-os ~ xhtgp-^ For the damping 
wing, we must know the total path length of the neutral segment of 
gas (i.e., Zn -— zq in equation O). This is difficult to compute pre- 
cisely, because it depends on the large-scale distribution of the HII 
regions. We will take Aa^ = 0.5 and assume that the neutral frac- 
tion is xh within this region. We examine the importance of this 
prescription below (where we also show some examples of i?max). 
Figure |6| shows how the absorption statistics evolve through 
reionization in our model. The thick and thin curves assume C, = 12 
and 40, respectively. The differences between the two are relatively 
modest, despite being displaced in redshift by Az ~ 3, and occur 
because the higher cosmic densities in the (^ = 40 model make ab- 
sorption somewhat stronger. The curves are fairly flat until some 
characteristic optical depth is reached; this is because the bubbles 
themselves have a characteristic size. The curves flatten with time 
because resonant absorption becomes less significant as the bubbles 
grow and because the peak in the bubble size distribution sharp- 
ens with time. The expected number of absorbers is clearly quite 
small; even when xh ~ 0.1, only ~ 0.1 gaps with r <4 are ex- 
pected per unit redshift. However, in contrast to Barkana 1 2002), the 
probability is not vanishingly small unless xh ^ 0.5. Even a few 
quasars shortly beyond reionization could show some transmission 

® Note that we neglect the spatial variation of xu here. In principle, we 
should compute the mean value of xh smoothed across a thermal width 
of the line, but the width is usually small compared to the length scales of 
interest. 
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Figure 6. The number of transmission gaps per unit redshift dn/dz with 
an optical depth smaller than the specified value. The thick curves assume 
C = 12 and have xh = 0.12 {z = 8.5, solid), xh = 0.3 (z = 9, long- 
dashed), and X[j = 0.5 (z = 10, short-dashed), Xff = 0.66 (z = 11, 
dotted). The thin curves assume (^ = 40 at the same neutral fractions; these 
occur Az ~ 3 earlier than the thick curves. 



features. The difference arises because lSarkana (2002) considered 
isolated HII regions around galaxies, for which the damping wing 
absorption provides a relatively large "floor" in the optical depth. In 
our model, the damping wing absorption can be much smaller at the 
centers of large bubbles. Finally, we have set riroc = when com- 
puting the resonant absorption (see the discussion of equation 1101 ). 
If recombinations have significantly delayed reionization, dn/dz 
will increase approximately in proportion to 1/(1 + rircc), except 
at the bright end where the damping wing is important. 

We note here that it may be easier to detect regions of weak ab- 
sorption in the Ly/3 Gunn-Peterson trough, because rp — 0.16ra; 
the optical depth axis in Figure |6| can simply be scaled by this 
amount. We therefore expect a factor of several more such gaps 
near the end of reionization and orders of magnitude more dur- 
ing the early stages. Of course, the Ly/? region is blanketed by the 
lower-redshift Lya forest, which will cover some fraction of the 
transmission gaps (especially since the Lya forest should still be 
fairly deep at the relevant redshifts). 

Assuming a uniform density IGM, we can easily estimate 
the widths of the transmission gaps. In the regime in which res- 
onant absorption dominates, the optical depth will increase by 
a factor of two over a distance along the line of sight equal to 
the impact parameter: AA/Aobs ~ i?(2)_Rmax/c ~ 0.003[(1 + 
z)/10]'^'^(i?max/Mpc). Interestingly, the line widths can thus 
break the degeneracy between a small number of clustered sources 
and a large number of distributed sources. As described above, the 
total cross section does not change if we separate the central ioniz- 
ing source into A'' small clusters, because -Rmax oc N^^'^ . How- 
ever, each individual line of sight must pass closer to a source if 
clustering is weak, so the line widths would decrease. Similarly, if 
the central sources of HII regions are bursty, each "active" region 
would have a larger -Rmax, making the individual lines larger. Note, 



Figure 7. Top panel: The cross-section for bubbles. The solid curves have 
R}, = 15, 10, and 5 Mpc, from top to bottom. The long and short dashed 
lines modify the damping wing (see text) and have Ri, = 10 Mpc. Bottom 
panel: Density of transmission gaps. The lower solid curve is our default 
model, and the long and short dashed lines modify the damping wing. The 
dotted line assumes that the resonant absorber has A = 0.5. The upper 
solid line includes only absorption from the damping wing, while the dot- 
dashed line includes only resonant absorption. All curves (in both panels) 
assume C, = 12 and are at xjj = 0.3 (z = 9). 



however, that the optical depth gradients are much smaller if Tdamp 
is important. 

Figure shows how our results depend on some of our in- 
put parameters. The top panel shows i?max if C — 12 and 2 = 9 
(at which time xh = 0.3). The three solid curves correspond to 
Rb ~ 15, 10, and 5 Mpc, respectively. In all cases, the impact pa- 
rameter must be a small fraction of the total bubble size in order for 
the local ionizing background to be large enough to reduce the res- 
onant absorption. The ratio Rmax/Rb increases with Rb both be- 
cause the damping wing is farther away and because the source den- 
sity increases. These curves change weakly with redshift because 
of the increased absorption in a denser universe. The other curves 
show how the results change with different assumptions about the 
damping wing. To construct the long-dashed curve, we assumed a 
fully neutral medium with Azd ~ 0.5. The larger Tdamp induces 
a more significant cutoff at the bright end, but the effects are not 
large at moderate optical depths because resonant absorption dom- 
inates. The sho rt-dashed curve uses a prescription similar to that of 
iBarkanal i2002ft . We let duiou/dz be the number of ionized bubbles 
intersected per unit redshift; this is the same as equation Jsj except 
that i?max -^ Rb- The mean path length of a neutral segment (in 
redshift units) is then given by xh/Az^ ~ driion/dz. The damp- 
ing wing absorption declines in this model because the path length 
between ionized bubbles is small. However, when xh is small the 
prescription makes only a small difference because the damping 
wing is unimportant anyway. 

The bottom panel of Figure shows the effects on the trans- 
mission gap statistics. The lower solid curve is the normal model 
of Figure |6| while the dashed curves make the same assumptions 
as in the top panel. The effects on the gaps are clearly modest, ex- 
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cept at the bright end. However, note that our assumptions about 
Tdamp can be much more important for xh ^ 0.5, because even at 
that time dn/dzio-a can be relatively large (although the path length 
through each ionized bubble is small). Nevertheless, the number 
of observable transmission gaps is extremely small in this regime, 
even in the most optimistic scenario. 

The upper solid curve is identical to our fiducial model ex- 
cept that we include only the damping wing component in the 
absorption. The amplitude is much larger in this case: as argued 
above, resonant absorption is the principal limiting factor in find- 
ing gaps. Comparing the small r tails of the distributions, we see 
that the cutoff in the number of absorbers is fixed by the "floor" 
in damping wing absorption. In contrast, the dot-dashed curve in- 
cludes only resonant absorption. The number density is not far from 
the full model (except at the bright end), indicating that this quan- 
tity controls the gap abundance. This has important implications 
for interpreting quasar spectra in light of complex reionization his- 
tories. Figure |6| assumes a simple model with a single generation 
of sources. The dominance of resonant absorption implies that the 
statistics will be qualitatively unchanged for more complex mod- 
els, although we would expect a substantial increase in the number 
of nearly transparent gaps (with r < 1). For example, if the ex- 
ternal IGM is partially ionized, rdamp will decrease and the bub- 
bles will be larger for a given ionizing emissivity. However, the 
resonant absorption within the bubbles will remain the same, and 
the dot-dashed curve shows an upper limit on the resulting num- 
ber of gaps. Similarly, although an early generation of sources can 
increase the bubble size, it does not affect the resonant absorption. 
Thus we expect that dn/dz places stronger constraints on the in- 
stantaneous ionizing emissivity than on the total ionized fraction, as 
long as the HII regions are large enough to reduce rdamp to a sub- 
dominant role. This could be especially interesting when combined 
with other methods (such as Lya emitters or 21 cm tomography) 
that are more sensitive to the total ionization history. 

4.3 The Effect of an Inhomogeneous IGM 

The most important simplification we have made is to neglect 
the density distribution of the IGM. On the one hand, small-scale 
dumpiness increases the mean recombination rate and hence the 
resonant absorption inside bubbles. On the other hand, underdense 
regions with smaller optical depth fill most of the volume, so the 
majority of any line of sight is likely to pass through regions with 
smaller absorption. As a quick illustration, the long-dashed curve 
in Figure Q shows how the results change if we assume that the 
gas causing resonant absorption has A = 0.5, where p = Ap. 
Equation shows that in ionization equilibrium the local neutral 
fraction is proportional to the local density, and the optical depth 
has an extra power of density. Thus Trcs oc A^ at a fixed distance 
from the center of the bubble. This is close to the behavior we find 
in Figure at moderate r, though the amplification is smaller at 
r < 1 because the damping wing becomes important. 

This is only the crudest estimate of the importance of den- 
sity fluctuations. To do better, we must average the magnitude of 
absorption over the density structure of the IGM: we wish to com- 
pute Tcff = — ln{e~^) rather than simply (r) as we did above. 
The best way to approach the problem is with numerical simula- 
tions, but these are computationally expensive and cannot yet be 
performed with the necessary dynamic range. Here we take an ap- 
proximate approach. We wish to estimate the fluctuations in the 
density field when smoothe d along the line of sigh t over a thermal 
width of the absorbing gas. iMiralda-Escude et alJ yOOOf) showed 



that the volume-averaged probability distribution of gas overden- 
sity in numerical simulations at 2 ~ 2-4 is well fit by 
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when smoothed on the Jeans mass of the photoionized gas. The 
form is motivated by interpolating between linear theory and the 
growth of voids and halos in th e nonlinear regime. The par ameter 
5q describes the linear regime: IMiralda-Escude et alj yOOO) find 
that it is given by So = 7.61/(1 -|- z). The exponent (3 depends on 
the density profile of collapsed objects; we will set 13 — 2.5, the 
value it takes for isothermal spheres. We then fix the remaining pa- 
rameters A and C by normalizing the mass and volume-weighted 
probability densities. Given the density distribution, we can com- 
pute the effective volume-averaged optical depth, t^b, correspond- 
ing to a nominal optical depth for the mean density IGM, Tmoan, 
through 



(e ") = / dAPy(A)exp(-rmcanA''). 



(15) 



Unfortunately, ther e are several re asons why this treatment is 
only approximate. First. lMiralda-Escude et al. | 2000) used simula- 
tions with flm ~ 0.4, Qa = 0.6, and as = 0.79, so there should 
be minor differences with the cosmology we have chosen. More- 
over, the simulation did not resolve fluctuations on small scales: 
while the Jeans mass at T = 10* K is resolved, fluctuations on 
smaller scales are not. Such small scale dumpiness is probably not 
important at z ~ 3, for which the simulations were designed, be- 
cause the thermal pressure of the ionized gas would have smoothed 
any density fluctuations. However, it may underestimate the fluc- 
tuations in recently ionized gas that has not yet relaxed into pres- 
sure equilibrium. Minihalos are an important example that could 
drastically i ncrease the mean recombination rate until they pho- 
toevaporate I'Barkana & Loetl l20o3: IShapiro et alj|2004 . Further- 
more, equation <14t smooths over the Jeans scale, which is slightly 
smaller than the thermal broadening width relevant for absorption 
studies. (However, we should also perform the average over one 
dimension rather than three.) Finally, equation <14t does not take 
into account the environment of the volume element under consid- 
eration. For resonant absorption, we are primarily concerned with 
regions that are near galaxies or groups of galaxies; we would there- 
fore expect that the relevant regions would be less likely to contain 
a void. It is not dear which of these effects dominate, so we sim- 
ply take equation <14t at face value and caution the reader that our 
results are only approximate. 

We show the results of equation <14t in the top panel of Fig- 
ure|8|for three different redshifts, z = 9, 10, and 11 (from bottom 
to top). We see that Tcs decreases slowly as redshift decreases be- 
cause structures condense and voids fill up more of the volume. 
The decrease in apparent optical depth is a factor of ~ 3-4 at mod- 
erate r. The difference becomes negligible for small t (and dose 
enough to zero, we even have Tcb > Tmcan) as the extra dumpi- 
ness becomes important. The correction to r is significantly smaller 
than that of lHaimanI i2002ft . who assumed a much clumpier density 
distribution. 

The bottom panel of Figure |8| compares the dn/dz distribu- 
tions for a uniform IGM (thick lines) with those using Toff (thin 
lines). As expected from Figure Q the differences are substantial 
(up to an order of magnitude at r ~ 10), although they become 
much less important at small optical depths. On the other hand, 
the number of transmission gaps remains small but non-negligible, 
so our fundamental conclusions are not affected. Additionally, we 
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Figure 8. The top panel shows the ratio of r^ff = — In(e~'^ ) to the optical 
depth of the mean density IGM, r, at z = 9, 10, and 11 (solid, dashed, 
and dotted curves, respectively). The bottom panel shows the number of 
transmission gaps per unit redshift dn/dz. The thick curves assume that 
all gas has the mean density, while the thin curves account for the density 
distribution of the gas as in the top panel. AU the curves assume C, = 12, 
which has xh = 0.3 (z = 9), xh = 0.5 (z = 10), and xu = 0.66 
(z = 11). 



emphasise that our method may miss extra small-scale dumpiness 
and the environmental bias. We conclude that transmission gaps are 
likely to be rare before reionization, but their appearance near the 
end of overlap is by no means impossible. 

We note that the density distribution does not have a strong 
effect on the damping wing, because equation J5j averages over 
a large path length through the IGM. Also, the Toff curves in the 
bottom panel of Figure |8|cannot be scaled to t/j, because the cor- 
rection in equation <15> must then be evaluated with t/j, mean in the 
exponent. 



4.4 The Transmission Gap in SDSS J1148-I-5251 

To date, SDSS has found fou r quasars with apparent Gunn-Peterson 
troughs, all at z > 6.2 JFanetalJl2002 ; iBecker et alj 1200 it 
iFan et alj|2003l: IWhite et alj|2003l iFan et alJ l2004V Toeether. the 
four troughs have a total path length of Aa ~ 1. SDSS Jl 148-1-525 1 
has a transmission spike at 8609 A(z = 6.08 for a Ly g line) with 
an ap parent r ~ 2.5, as well as a matching Ly/3 peak iWhite et alJ 
|2003)- White et al. point out that such a gap is not hard to imagine 
if one includes only the damping wing; however, they neglected 
resonant absorption. Because it treats each galaxy in isolation, the 
model o nBarkanal i2002h essentially requires that this gap appears 
only well after overlap. 

As we have emphasised, our large HII regions reduce the 
severity of the damping wing and allow transmission spikes near 
the end of overlap. Figure|9|shows the expected transmission statis- 
tics for Xh = 0.1, 0.15, 0.2, and 0.25 (top to bottom) at z = 6.1. 
We have used our standard assumptions plus the inhomogeneous 
IGM model of i|4.3l to construct the curves. We see that dn{T < 
2.5)/dz ~ 0.25 — 0.5 for these neutral fractions. This is entirely 



Figure 9. The expected number of transmission gaps at z = 6.1 if the 
universe has xjj = 0.1, 0.15, 0.2, and 0.25 (solid, long-dashed, short- 
dashed, and dotted curves, respectively). We have used the inhomogeneous 
IGM model described in i|4.3lto construct the curves. 



consistent with the observed gap; thus we argue that its existence 
does not require that overlap occurred before z — 6.1. Of course, 
it is still perfectly consistent with such a scenario. 

We caution the reader against overinterpreting these results. 
Uncertainties about the true distribution of sources and the den- 
sity structure of the IGM make a quantitative comparison difficult 
at present (especially with the small observational sample). More- 
over, our model probably begins to break down near overlap: by 
this point, the bubbles are so large that small scale structure and 
Lyman-limit systems will play an important role in determining the 
ionizing background. 

Finally, the same quasar spectrum shows other Ly/3 transmis- 
sion gaps that could be compared to our model. However, the line of 
sight also contains a CIV absorber al z = 4.93, and s ome of these 
spikes are likely emission lines from the same system iWhite et alJ 
|2003). More data about which gaps are truly due to the interloper 
are crucial for extracting the most information possible from this 
spectrum. 



5 DISCUSSION 

We have considered two methods to study the topology of reion- 
ization through near-infrared observations of high-redshift sources. 
The most promising technique is to measure the effects of damp- 
ing wing absorption on the Lya emission lines of high-redshift 
galaxies. Narrowband searches for galaxies with strong lines are 
one usef ul technique, and these survey s currently ex tend to z ~ 
6.5 (e.g.. liiu et al]|200l iKodaira et aljl200 3 : Rhoad s et alJl2004 
Stanwav et a l. 2004b; Santos et alJI2004ft . They become increas- 
ingly difficult at higher redshifts because of the sky background, 
but some narrow transmission windows do remain in the near- 
infrared, and we expect high-redshift gal axies to have strong intrin- 
sic Lya lines (e.g.,|^rtridee & Peebl e^l 1 967t iBarton et al. 200^). 
Of course, the technique need not rely on galaxies selected in this 
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way and can be appli ed to broadband photometric dropouts (e.g., 
IStanwav et alj|2004al) as well as m ore specialised selection tech- 
niques (e.g., gravitational lensing, iKneib et al.ll2004l : IPello et alJ 
[2004 ). 

Our results have several implications for such surveys. First, 
Lya selection should be able to extend to eras well before reioniza- 
tion. Although at these times many galaxies will suffer significant 
damping wing absorption, reducing the apparent strength of their 
Lya emission lines, a substantial fraction of galaxies will sit inside 
of large HII regions that allow transmission of a significant amount 
of flux in the Lya line even when xh ~ 0.75. We thus expect 
that the number density of narrowband-selected galaxies will de- 
cline with redshift but that they will not disappear entirely. Second, 
because of the wide variance in IGM absorption for galaxies with 
the same intrinsic properties, we find that a single galaxy cannot 
provide tight constraints on either the ionization state of the IGM 
_or the properties of the host galaxy (Hu et al. 2002; Rhoads et al. 
r2004lLoeb rta l. 2004: Ri cotti et alj2004lCen et al. 2004). Includ- 
ing the galaxy's neighbors can make a dramatic difference to the 
expected level of inferred absorption. For example, massive galax- 
ies at z = 6.5 have a > 20% probability of sitting in HII regions 
with Tdamp < 2 even if xh ~ 0.5. Fortunately, the most massive 
galaxies generally dominate their local HII regions, so in that case 
such inferences will not be as dangerous (see Figure |3j. Finally, 
our model also shows that cosmic variance in Lya-selected surveys 
will be even greater than expected, because the local overdensity 
of galaxies modulates the damping wing absorption (see Figure|4}. 
We predict that, although some galaxies remain visible if xh ^0.5, 
they are confined to increasingly rare regions of the universe. 

On the other hand, the wide range of damping wing absorp- 
tion expected through the middle ranges of reionization allows us 
to constrain the size distribution of HII regions during this era. This 
in turn will teach us about the properties of both the IGM and the 
high-redshift sources. Interestingly, because partial uniform reion- 
ization (such as from X-rays; Oh 2001; Venkatesan et al. 2001) 
and double reionization (e.g.. iWvithe & Loedl2003l ICeril2003bl: 
iHaiman & Holdei..2003) affect the bubble sizes, such surveys can 
also help to constrain these models. Ideally, such a survey would 
use galaxies that are not selected through narrow-band imaging to 
avoid the selection effect described above, but for the time being 
Lya line surveys look to be the most promising technique (at least 
for large cosmological volumes). Of course, we must separate the 
damping wing absorption from a variety of other complications, 
including gas infall, small-scale dumpiness, and galactic winds 
(Santos 2004) before strong constraints can be made. With large 
samples, this should not be too difficult because we do not expect 
these local features to differ substantially across the large-scale en- 
vironments that determine the topology of ionized bubbles. 

A related technique is to measure the damping wing absorp- 
tion from a gamma-ra y burst afterglow (.Miralda-Escude. . 1 998t 
iBarkana & Loebl l2004h . The advantage of these objects is that 
their smooth power-law spectra (and lack of Lya emission lines) 
makes interpreting the damping wing much simpler. Follow-up 
spectroscopy could then measure the host galaxy's redshift and 
comparison to the damping wing would yield the size of the HII re- 
gion. These host galaxies would be selected in an unbiased manner 
relative to the Lya absorption, which would make their interpreta- 
tion easier. 

Of course, our model is only approximate. Numerical simu- 
lations are required to address questions such as the anisotropy of 
Hn regi ons, the resonant absorp tion, and the effects of peculiar ve- 
locities. |Gnedin.&Prada| y004|) have already made some steps in 



this direction; however, our model suggests that their box (with size 
8^~^ comoving Mpc) is probably too small to obtain precise mea- 
surements, and in any case they were only able to examine one 
specific reionization history. 

A second method to study the topology of reionization is to 
examine high-resolution quasar absorption spectra, which could 
contain gaps in the Gunn-Peterson trough when the line of sight 
passes close to the center of a large HII region. Previous studies 
had suggested that the strong damping wing absorption by the sur- 
rounding neutral IGM makes such gaps extremely rare. We have 
shown that the clustering of high-redshift galaxies makes the ion- 
ized bubbles much larger than naively expected, reducing the con- 
tribution of the damping wing. In this case, gaps are still expected 
to be rare because residual neutral gas (even with xh ~ 10^'') can 
provide substantial resonant absorption; late in reionization we ex- 
pect dn/dz ~ 0.02-0.3 for features with r < 2. However, weak 
transmission gaps are not so rare that their presence automatically 
implies an earlier episode of reionization. In partic ular, the gap 
at z = 6.08 in the spectrum of SDSS Jl 148-1-5251 iWhite et alj 
2003) does not necessarily imply an earlier overlap epoch (though 
of course it is consistent with such a scenario). It is encouraging 
that existing and near-future observations can help to constrain the 
topology of reionization. Because this technique relies on active 
ionizing sources to eliminate the resonant absorption, it is not terri- 
bly sensitive to complex reionization histories (which mostly affect 
the damping wing). Large HII regions already reduce the damp- 
ing wing to manageable levels. Instead quasar absorption spectra 
are most sensitive to the instantaneous ionizing rate. We have also 
shown that the number of transmission gaps is nearly independent 
of the clustering and burstiness of ionizing sources. However, the 
widths of the gaps do vary with these quantities, so measurements 
of their sizes will help to constrain these parameters. 

The principal source of uncertainty in this calculation is the 
inhomogeneous density distribution of the IGM, which we have 
accounted for only approximately. Unfortunately, the dumpiness 
of the IGM is difficult to quantify analytically. Numerical simula- 
tions can do much better, but we note that resolving the small scale 
density structure is particularly difficult in this case because the 
transmission gaps occur in the rare, large HII regions with sizes 
> 10 Mpc (comoving). A complete solution would therefore re- 
quire a large box and high mass resolution. Moreover, our model 
for the HII regions is only approximate and breaks down somewhat 
before overlap, once the mean free path of a photon is determined 
not by the bubble size but by small-scale overdensities (i.e., Lyman- 
limit systems). The topology of reionization during this era is best 
addressed through numerical simulations. 

We also note that transmission gaps can occur if the line 
of sight passes near to a bright quasar (Miralda-Escude 199g; 
iMiralda-Escude et alj|200(j) . Quasars produce such strong ioniz- 
ing radiation fields (iV- y > 10^^ photons s~^ for the bright SDSS 
quasars, assuming the lElvis et al J 119941 template spectrum) that 
Tros is small within several comoving Mpc of the quasar. Esti- 
mates of the number density of gaps are difficult because they 
depend on the unknown luminosity function of fainter quasars. 
The best current estimate for quasar s with M1450 < 26.7 is 
WQSO ~ 6 X 10"^" Mpc~^ at z ~ 6 jFan et all2004l) . For these 
quasars, we then have dn/dz ~ 10~* for t < 1 even if we neglect 
damping wing absorption. Thus the currently observed population 
should provide a negligible number of intersections. Extrapolating 
to fainter quasars is dangerous given the lack of observational con- 
straints; h owever, if the lu minosity function has a power law slope 
/3 ~ — 3 IFan et alJl2004h at the bright end, this steep component 
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would have to extend to quasars a factor of > 100 fainter than the 
observed population in order for gaps due to quasars to become 
comparable to our predictions for xh < 0.3. This would require 
the break luminosity to occur at sign ificantly smaller luminosities 
than observed at z ~ 2 JBovle et alj2 000V Note also that the num- 
ber density of quasars declines rapidly with redshift (it drops by a 
factor ~ 30 from z = ?> Xo z = &; Fan et al. 2004), so quasars 
likely become even less significant at higher redshifts. Finally, be- 
cause quasar HII regions are so large, the gradients in resonant ab- 
sorption are relatively gentle and gaps near quasars should be quite 
wide in velocity space. 

In summary, we find that both detailed studies of the absorp- 
tion suffered by the Lya emission lines of star-forming galaxies 
and searches for transmission gaps in the spectra of high-redshift 
quasars will enable us to constrain the size distribution of HII re- 
gions in the middle and late stages of reionization. Both of these 
methods are already able to probe the tail end of reionization at 
2: ~ 6, and future surveys hold great promise to teach us even 
more. 
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